Inducing Relatedness Graphs for Data Integration
نویسندگان
چکیده
In this paper, we present the AbsMatcher system for schema matching which uses a graph based approach. AbsMatcher creates a graph of related attributes within a schema, mines similarity between attributes in different schemas, and then combines all information using the ABSURDIST graph matching algorithm. The focus of this paper is on methods for generating relationships which are semantic in nature, but only require a simple data model. These relationships sources provide a baseline to be used when no others are available. Simulations demonstrate how the use of automatically mined graphs of within-schema relationships, when combined with cross-schema pair-wise similarity, can result in matching accuracy not attainable by either source of information on its own.
منابع مشابه
High relatedness and inbreeding at the origin of eusociality in gall-inducing thrips.
Within the haplodiploid eusocial gall-inducing thrips, a species-level phylogeny combined with genetic data for five eusocial species enables an inference of levels of relatedness and inbreeding values for lineages at the origin of eusociality. Character optimization using data from five eusocial species indicates that the lineage or lineages where eusociality is inferred to have originated exh...
متن کاملTaxonomy Induction Using Hierarchical Random Graphs
This paper presents a novel approach for inducing lexical taxonomies automatically from text. We recast the learning problem as that of inferring a hierarchy from a graph whose nodes represent taxonomic terms and edges their degree of relatedness. Our model takes this graph representation as input and fits a taxonomy to it via combination of a maximum likelihood approach with a Monte Carlo Samp...
متن کاملUsing proximity to compute semantic relatedness in RDF graphs
Extracting the semantic relatedness of terms is an important topic in several areas, including data mining, information retrieval and web recommendation. This paper presents an approach for computing the semantic relatedness of terns in RDF graphs based on the notion of proximity. It proposes a formal definition of proximity in terms of the set paths connecting two concept nodes, and an algorit...
متن کاملPresentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures
Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...
متن کاملPresentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures
Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009